PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa14g036100.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HB-other
Protein Properties Length: 1730aa    MW: 194428 Da    PI: 5.0131
Description HB-other family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa14g036100.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox60.52.7e-1945100257
                     T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHHC CS
        Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakekk 57 
                      kR+  t+ qle+Le+++ +++yps+++r++L++kl+Lt+rq ++WF+ rR k+kk
  Csa14g036100.1  45 PKRQMKTPFQLETLEKVYSEEKYPSEATRADLSDKLNLTDRQLQMWFCHRRLKDKK 100
                     69****************************************************98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.603.0E-1921103IPR009057Homeodomain-like
SuperFamilySSF466892.4E-1636101IPR009057Homeodomain-like
PROSITE profilePS5007116.53641101IPR001356Homeobox domain
SMARTSM003895.0E-1843105IPR001356Homeobox domain
PfamPF000467.4E-1745100IPR001356Homeobox domain
CDDcd000861.58E-1446100No hitNo description
SMARTSM005714.6E-24563622IPR018501DDT domain
PROSITE profilePS5082718.031563622IPR018501DDT domain
PfamPF027919.4E-18564619IPR018501DDT domain
PfamPF050669.1E-16745812IPR007759HB1/Asxl, restriction endonuclease HTH domain
PfamPF156129.1E-7945986IPR028942WHIM1 domain
PfamPF156134.5E-1311211193IPR028941WHIM2 domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1730 aa     Download sequence    Send to blast
MEMGSDEEED HQIRSVADVV GGGGGSSNNK KKNKIDNSSS SSAKPKRQMK TPFQLETLEK  60
VYSEEKYPSE ATRADLSDKL NLTDRQLQMW FCHRRLKDKK DDQSQSKTPP VKPAVPPVRP  120
PPPALASSVH DLPPARSVPE QDSGSGSDSG SGCSPYSDSR RNFASGSSSS RAELDEYETT  180
MGKPSYEPRL SAMVRRAIVC IEAQLGEPLR DDGPILGMEF DPLPPGAFGT PIAMQKHLLH  240
PYESKMYEPH DVRPRRSQAA ARSFHEQQSL DDPSSFTPEM YGRYSENHAH GMDYEIARPR  300
SSSFMHENGS LPRSYGTPGY VSRNCSTSQQ DMPSPIVASA HRGDRFLMEK DSSVLRTEDP  360
YMLSDGVRKS NDVHRKGKIH DVRLGRGSET RENRGPKDLE KLEIQKKKNE ERMRKEMERN  420
ERERRKEEER LMRERIKEEE RLQREQRREM ERREKFLQRE NERAEKKKQK EEIRREKDAI  480
RRKIAIEKAT ARRIAKESMD LIEDEQLELM DLAAINKGLP SVLQLDHDTL QNLELYRDSL  540
STFPPKGLQL KMPFTISPWK DSDESVGNLL MVWRFLTSFS DVLDLWPFTL DEFIQAFHDY  600
DSRLLGEIHV TLLRSIIRDI EDVARTPFSG IGNNQYTTAN PEGGHPQIVE GAYAWGFDIR  660
SWKKNLNPLT WPEILRQLGL STGLGPRLKK KNSRLTHTGD KDEAKGCEDI ISTIRSGSAA  720
ESAFALMREK GLLAPRKSRH RLTPGTVKFA AFHVLSLEGS KGLTVLELAD KIQKSGLRDL  780
TTSKTPEASI SVALTRDVKL FERIAPSTYC VRAPYVKDPA DGKAILADAR KKIRAFESGL  840
TGPEDVNDLE RDEDFEIDID EDPEVDDLAT LASASKGADL GEANVFSGKG GDTMFCDVKA  900
GVKSEIEKEF SSPPPSSIKS IVPQHNERLK DTAVGCLDAM VDESNEGQSW IQGLTEGDYC  960
HLSVEERLNA LVALVGIANE GNSIRSGLED RMEAANSLKK QMWAEAQLDN SCMRDVLKLD  1020
FQNLASSKTE STTGLPIIQS ANRERDNFGG DPSELLDETK PLEVVSNDLQ KSTAERGLII  1080
NQEANISQEN CSFQQGYASK RSRSQLKSYI GHKAEEVYPY RSLPVGQDRR HNRYWLFAAS  1140
ASKSDPSSGL LFVELHDGKW LLIDSEEAFD TLVASLDMRG IRESHLRIML QKIEGSFKEN  1200
ARKNMKLARN PFLKEKSVMN HSPSDSVSPS SAVSGSNSDS METSNSIRVE LGRNDTEKKS  1260
LSKRFHDFQR WMWTETYSSL PSCAKKYGKK RSELLATCAL CFASYLSEYT HCTSCHQRSD  1320
MVDGSEILDS GLTVSPLPFG VRLLKPLLVF LEASVPDEAL ESFWTEDKRK MWGFRLNASS  1380
SPEELLQVLT SLESAIKKEY LSSNFMSAKE LLGVGDANVD DPGSVDVLPW IPKTVSAVAL  1440
RLSELDASII YVKPEKPDLI PEDENEQISL FPGDSLFKGK GPREQEDKDE VVPNLGNRRS  1500
NKRARVSLGS GSNKKVKRKK AQGGPNRFVV SRRNVAVDNN LMSMELNHQV PGRGKRTVRK  1560
RPERINEDND HIVNRMADIV RPKSQEVEED EEEEEQTFRD IDEDWAAGET PREMDDDWAN  1620
ETPNRMMTPM QVDDESDNSV GVESEDDDVD GQFVDYSQRN KWGLDWNSNP NEAAMEDEEE  1680
EEVVGVERVE GEDDAEMSES SEDDDDVPAN NAANNYDRES EGGYSSSDS*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
193100RRLKDKKD
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0101550.0AC010155.3 Genomic sequence for Arabidopsis thaliana BAC F3M18 from chromosome I, complete sequence.
GenBankCP0026840.0CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010460638.10.0PREDICTED: uncharacterized protein LOC104741454 isoform X1
SwissprotF4HY560.0RLT1_ARATH; Homeobox-DDT domain protein RLT1
TrEMBLD7KCW80.0D7KCW8_ARALL; HB-1
STRINGfgenesh2_kg.1__3015__AT1G28420.10.0(Arabidopsis lyrata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM83472236
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G28420.10.0homeobox-1